Graph Grammar Induction as a Parser-Controlled Heuristic Search Process
نویسندگان
چکیده
A graph grammar is a generative description of a graph language (a possibly infinite set of graphs). In this paper, we present a novel algorithm for inducing a graph grammar from a given set of ‘positive’ and ‘negative’ graphs. The algorithm is guaranteed to produce a grammar that can generate all of the positive and none of the negative input graphs. Driven by a heuristic specific-to-general search process, the algorithm tries to find a small grammar that generalizes beyond the positive input set. During the search, the algorithm employs a graph grammar parser to eliminate the candidate grammars that can generate at least one negative input graph. We validate our method by inducing grammars for chemical structural formulas and flowcharts and thereby show its potential applicability to chemical engineering and visual programming.
منابع مشابه
Frontier Pruning for Shift-Reduce CCG Parsing
We apply the graph-structured stack (GSS) to shift-reduce parsing in a Combinatory Categorial Grammar (CCG) parser. This allows the shift-reduce parser to explore all possible parses in polynomial time without resorting to heuristics, such as beam search. The GSSbased shift-reduce parser is 34% slower than CKY in the finely-tuned C&C parser. We perform frontier pruning on the GSS, increasing th...
متن کاملGlr* : a Robust Parser for Spontaneously Spoken Language
This paper describes GLR*, a parsing system based on Tomita's Generalized LR parsing algorithm, that was designed to be robust to two particular types of extra-grammaticality: noise in the input, and limited grammar coverage. GLR* attempts to overcome these forms of extra-grammaticality by ignoring the unparsable words and fragments and conducting a search for the maximal subset of the original...
متن کاملStudying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کاملA Heuristic Driven Chart-Parser for Attributed Node Labeled Graph Grammars and its Application to Feature Recognition in CIM
To integrate CA*-systems with other applications in the CIM world, one principal approach currently under development is the feature recognition process based on graph grammars. It enables any CIM component to recognize the higher-level entities the so-called features used in this component out of a lower-data exchange format, which might be the internal representation of a CAD system as well a...
متن کاملHeuristic Driven Chart-parsing
To integrate Computer Aided systems (CA*-systems) in the area of Mechanical Engineering (ME) with other applications in the world of Computer Integrated Manufacturing (CIM), one principal approach under development is based on feature representation. It enables any CA*-system to recognize the higher-level entities { the so-called features in ME { out of a lower-data exchange format. In this pap...
متن کامل